Correlation - Based and Contextual Merit - BasedEnsemble Feature

نویسندگان

  • Seppo Puuronen
  • Alexey Tsymbal
  • Iryna Skrypnyk
چکیده

Recent research has proved the beneets of using an ensemble of diverse and accurate base classiiers for classiication problems. In this paper the focus is on producing diverse ensembles with the aid of three feature selection heuristics based on two approaches: correlation and contextual merit-based ones. We have developed an algorithm and experimented with it to evaluate and compare the three feature selection heuristics on ten data sets from UCI Repository. On average, simple correlation-based ensemble has the superiority in accuracy. The contex-tual merit-based heuristics seem to include too many features in the initial ensembles and iterations were most successful with it.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Use of Randomization to Normalize Feature Merits

Feature merits are used for feature selection in classi cation and regression as well as for decision tree generation. Commonly used merit functions exhibit a bias towards features that take a large variety of values. We present a scheme based on randomization for neutralizing this bias by normalizing the merits. The merit of a feature is normalized by division by the expected merit of a featur...

متن کامل

A Model for Detecting of Persian Rumors based on the Analysis of Contextual Features in the Content of Social Networks

The rumor is a collective attempt to interpret a vague but attractive situation by using the power of words. Therefore, identifying the rumor language can be helpful in identifying it. The previous research has focused more on the contextual information to reply tweets and less on the content features of the original rumor to address the rumor detection problem. Most of the studies have been in...

متن کامل

The Role of Biomedical Dataset in Classification

In this paper, we investigate the role of a biomedical dataset on the classification accuracy of an algorithm. We quantify the complexity of a biomedical dataset using five complexity measures: correlation-based feature selection subset merit, noise, imbalance ratio, missing values and information gain. The effect of these complexity measures on classification accuracy is evaluated using five d...

متن کامل

Comparison Between Different Methods of Feature Extraction in BCI Systems Based on SSVEP

‎There are different feature extraction methods in brain-computer interfaces (BCI) based on Steady-State Visually Evoked Potentials (SSVEP) systems‎. ‎This paper presents a comparison of five methods for stimulation frequency detection in SSVEP-based BCI systems‎. ‎The techniques are based on Power Spectrum Density Analysis (PSDA)‎, ‎Fast Fourier Transform (FFT)‎, ‎Hilbert‎- ‎Huang Transform (H...

متن کامل

A General Investigation on the Combination of Local and Global Feature Selection Methods for Request Identification in Telegram

Nowadays, the use of various messaging services is expanding worldwide with the rapid development of Internet technologies. Telegram is a cloud-based open-source text messaging service. According to the US Securities and Exchange Commission and based on the statistics given for October 2019 to present, 300 million people worldwide used telegram per month. Telegram users are more concentrated in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007